Multimedia picture generating method, device and electronic device

ABSTRACT

The present disclosure provides a multimedia picture generating method, device and electronic device, wherein the multimedia picture generating method comprises acquiring a picture of a photographed subject of a photographing device; extracting a figure image as a foreground image from the picture after receiving an instruction for removing picture background; performing voice recognition after receiving a voice command inputted by a user; searching out multimedia content that matches a user command information recognized by voice recognition from a multimedia database as background content for the picture; and generating a multimedia picture that contains the foreground image and the background content. Thus, when a user wants to replace the picture background, a figure image can be automatically extracted from the picture as a foreground image, and the original background with poor effect can be removed, then an image and/or music that matches a user command information can be automatically searched out from a multimedia database, which increases the search efficiency, simplifies the optimum processing and improves the user experience.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No.PCT/CN2016/085839 filed on Jun. 15, 2016, which is based upon and claimspriority to Chinese Patent Application No. 201510861520.6 filed on Nov.30, 2015, the entire content of which is incorporated herein byreference.

TECHNICAL FIELD

The present disclosure relates to the field of multimedia technology,and in particular relates to a multimedia picture generating method,device and electronic device.

BACKGROUND

Devices with photographing capability such as smart phones and tabletcomputers have become very good social media tools because of theirfunctions such as convenience of carrying and ability of quicklydisseminating information. Along with their increasing popularity, moreand more auxiliary functions have been added thereto, for instance, thephotographed picture can be edited and then sent to social mediaplatforms for sharing by means of social media software (e.g. Wechat,Microblog, QQ chat tool) built in the smart phone or tablet computer.

Currently, if a user wants to modify a picture background, he/she needsto use a picture editing software to perform analyzing, repairing,beautifying and synthesizing on the picture. But, since the pictureediting software is highly specialized and very difficult to operate, anon-professional user is required to learn and be familiar with thecorresponding software operation methods in order to attain asatisfactory effect; even for a professional user, picking out an imagethat matches his/her own demands from a vast amount of images alsorequires a lot of time, and therefore a multimedia picture that matcheshis/her own demands cannot be quickly generated, which causes poorexperience.

SUMMARY

Therefore, the present disclosure provides a multimedia picturegenerating method, device and electronic device that are able tosimplify the optimum processing of a picture and thus improve the userexperience.

One objective of the embodiments of the present disclosure is to providea multimedia picture generating method, comprising:

-   -   acquiring a picture of a photographed subject;    -   extracting a figure image as a foreground image from the picture        after receiving an instruction for removing picture background;    -   receiving a voice command inputted by a user, and performing        voice recognition;    -   searching out multimedia content that matches a user command        information recognized by voice recognition from a multimedia        database as background content for the picture;    -   generating a multimedia picture that contains the foreground        image and the background content.

Another objective of the embodiments of the present disclosure is toprovide a multimedia picture generating device comprising:

-   -   a photographing apparatus for taking a picture of a photographed        subject of a photographing device;    -   a multimedia picture generating apparatus, connected with the        photographing apparatus, for acquiring a picture sent from the        photographing apparatus, extracting a figure image as a        foreground image from the picture after receiving an instruction        for removing picture background, performing voice recognition        after receiving a voice command inputted by a user, searching        out multimedia content that matches a user command information        recognized by voice recognition from a multimedia database as        background content for the picture, and generating a multimedia        picture that contains the foreground image and the background        content;    -   a display apparatus, connected with the photographing apparatus        and the multimedia picture generating apparatus, for displaying        the picture and the multimedia picture;    -   a player apparatus, connected with the multimedia picture        generating apparatus, for playing the background content.

A further objective of the embodiments of the present disclosure is toprovide an electronic device, comprising at least one processor; and amemory communicably connected with the at least one processor forstoring instructions executable by the at least one processor, whereinexecution of the instructions by the at least one processor causes theat least one processor to: acquire a picture of a photographed subject;extract a figure image as a foreground image from the picture afterreceiving an instruction for removing picture background; receive avoice command inputted by a user, and performing voice recognition;search out multimedia content that matches a user command informationrecognized by voice recognition from a multimedia database as backgroundcontent for the picture; generate a multimedia picture that contains theforeground image and the background content.

A further objective of the embodiments of the present disclosure is toprovide a non-transitory computer-readable storage medium storingexecutable instructions that, when executed by an electronic device,cause the electronic device to: acquire a picture of a photographedsubject; extract a figure image as a foreground image from the pictureafter receiving an instruction for removing picture background; receivea voice command inputted by a user, and performing voice recognition;search out multimedia content that matches a user command informationrecognized by voice recognition from a multimedia database as backgroundcontent for the picture; generate a multimedia picture that contains theforeground image and the background content.

The embodiments of the present disclosure provides a multimedia picturegenerating method, device and electronic device which comprisesacquiring a picture of a photographed subject; extracting a figure imageas a foreground image from the picture after receiving an instructionfor removing picture background; performing voice recognition afterreceiving a voice command inputted by a user; searching out multimediacontent that matches a user command information recognized by voicerecognition from a multimedia database as background content for thepicture; and generating a multimedia picture that contains theforeground image and the background content. Thus, when a user wants toreplace the picture background, a figure image can he automaticallyextracted from the picture as a foreground image, and the originalbackground with poor effect can be removed, so that the user is notrequired to optimize the picture background by using a specializedpicture editing software and instead is only required to send aninstruction for removing picture background in order to replace thebackground with a satisfactory image, which simplifies the optimumprocessing and improves the user experience. A user command informationcan be acquired by performing voice recognition on user voice, so as toautomatically search out multimedia content that matches the usercommand information from a multimedia database, and thus the user is notrequired to manually pick out multimedia content that matches his/herown demand from a vast amount of images and/or music, which increasesthe search efficiency. After that, a multimedia picture that containsthe foreground image and the background content is generated, so that aperfect combination of visual and audial effects can be attained.

BRIEF DESCRIPTION OF THE DRAWINGS

One or more embodiments are illustrated by way of example, and not bylimitation, in the figures of the accompanying drawings, whereinelements having the same reference numeral designations represent likeelements throughout. The drawings are not to scale, unless otherwisedisclosed.

FIG. 1 is a flow chart of a specific example of the multimedia picturegenerating method in Embodiment 1 of the present disclosure;

FIG. 2 is a flow chart of a specific example of searching out an imageand/or music that matches a user command information from a multimediadatabase in the multimedia picture generating method in Embodiment 1 ofthe present disclosure;

FIG. 3 is a block diagram of a specific example of the multimediapicture generating apparatus in Embodiment 2 of the present disclosure;

FIG. 4 is a block diagram of a specific example of the multimediapicture generating device in Embodiment 3 of the present disclosure;

FIG. 5 is a schematic diagram of the hardware configuration of theelectronic device in Embodiment 5 of the present disclosure, whichperforms the multimedia picture generating method.

REFERENCE NUMERALS

1—original photo acquiring unit; 2—foreground image extracting unit;3—command recognition unit; 4—matching unit; 5—generating unit;11—photographing apparatus; 12—multimedia picture generating apparatus;13—display apparatus; 14—player apparatus.

DETAILED DESCRIPTION OF EMBODIMENTS

In order to clearly describe objectives, the technical solutions andadvantages of the present disclosure, a clear and complete descriptionof the technical solutions in the present disclosure will be givenbelow, in conjunction with the accompanying drawings in the embodimentsof the present disclosure. Apparently, the embodiments described beloware a part, but not all, of the embodiments of the present disclosure.

Embodiment 1

The present embodiment provides a multimedia picture generating methodthat is used for a photographing device such as an electronic devicewith photographing capability including a cell phone, a pad, a notebookcomputer. As shown in FIG. 1, the method comprises the following steps:

S1, acquiring a picture of a photographed subject of the photographingdevice.

S2, extracting a figure image as a foreground image from the pictureafter receiving an instruction for removing picture background.

Specifically, contour analysis method is used to determine the figureimage in the picture and thus extract the figure image as a foregroundimage. Of course, any method in prior art that is able to extract afigure image from a picture can be used to implement the aforementionedoperation.

S3, receiving a voice command inputted by a user, and performing voicerecognition. Specifically, any software in prior art with voicerecognition function can be used to implement the function of voicerecognition.

S4, searching out multimedia content that matches a user commandinformation recognized by voice recognition from a multimedia databaseas background content for the picture. Multimedia content includestexts, images, sounds, animations and videos which can be integratedwith a picture as background content. The preferable embodiments of thepresent disclosure take images and/or music as examples and thepreferable background content includes background image and backgroundmusic. Specifically, a user might not be satisfied with thephotographing environment when taking a picture, for instance, when theuser takes a picture at home but wants to replace the picture backgroundwith a scene of a place of historic interest and scenic beauty or ascene of a place having excellent scenery, e.g. wants to replace thepicture background with a scene of the Eiffel Tower or the Fuji Mountainin Japan, the user makes a voice containing a content about the EiffelTower or the Fuji Mountain, then by performing voice recognition on theuser voice, a user command information can be acquired to set an imagecontaining a scene of the Eiffel Tower or the Fuji Mountain as a newbackground image for the picture; for another instance, when the userwants to add some favorite music into the picture, e.g. wants to add afavorite pop song, the user directly speaks the name of this pop song,then by voice recognition, a user command information can be acquired toset this pop song as background music for the picture. According to theaforementioned user command information, an image and/or music thatmatches the user command information is automatically searched out froma multimedia database and set as a background image and/or backgroundmusic for the picture, so as to ensure that the image and music set asthe background image and background music meet the real demands of theuser, thereby improving the user experience.

S5, generating a multimedia picture that contains the foreground imageand the background content.

By means of the multimedia picture generating method in this embodiment,when a user wants to replace the picture background, a figure image canbe automatically extracted from the picture as a foreground image, andthe original background with poor effect can be removed, so that theuser is not required to optimize the picture background by using aspecialized picture editing software and instead is only required tosend an instruction for removing picture background in order to replacethe background with a satisfactory image, which simplifies the optimumprocessing and improves the user experience. A user command informationcan be acquired by performing voice recognition on user voice, so as toautomatically search out multimedia content that matches the usercommand information from a multimedia database, and thus the user is notrequired to manually pick out multimedia content that matches his/herown demand from a vast amount of images and/or music, which increasesthe search efficiency. After that, a multimedia picture that containsthe foreground image and the background content is generated, so that aperfect combination of visual and audial effects can be attained.

Optionally, as shown in FIG. 2, the step S4 comprises:

S41, searching out an image and/or music corresponding to a keywordindex identical or similar to a keyword from the multimedia databasehaving keyword indexes corresponding to images and music stored thereinby using the user command information as the keyword. Specifically,music and images are stored with keyword indexes into the multimediadatabase so as to facilitate later searches. A common keyword indexingmeans is by using TIF formatted information, then by reading the TIFinformation of images and music, the content presented by the images andmusic can be known, which is very convenient. If the keyword indexcorresponding to an image and/or music is more similar to the usercommand information keyword, it indicates that the content presented bythe image and/or music is more close to the real demand of the user,with a higher matching degree. By searching out an image and/or musiccorresponding to a keyword index identical or similar to the usercommand information keyword, the image and/or music with the highestmatching degree in relation to the user demand can be ensured to bepushed to the user;

S42, setting the image and/or music corresponding to the keyword indexidentical or similar to the keyword as the multimedia content thatmatches the user command information; and

S43, if an instruction for re-searching images and/or music is received,searching out an image and/or music other than the images and/or musicalready searched out and corresponding to the keyword index identical orsimilar to the keyword from the multimedia database by using the usercommand information as the keyword, until no further instruction forre-searching images and/or music is received; then returning to the stepS42, and setting the image and/or music finally searched out andcorresponding to the keyword index identical or similar to the keywordas the multimedia content that matches the user command information.

The multimedia picture generating method in this embodiment ensures thatthe image and music finally selected as the background content alwayssatisfies the user, which lays a foundation for generating a multimediapicture containing the foreground image and the background content thathas a high user satisfaction, thereby improving the user experience.

Optionally, the multimedia picture generating method in this embodimentperforms the steps of determining if an instruction for removing picturebackground is received and determining if an instruction forre-searching images and/or music is received by recognizing user voice,so that the user is not required to manually input an instruction, andby recognizing a voice made by the user, a corresponding instruction canbe acquired and a corresponding operation can be carried out accordingto the instruction, so that both hands of the user are freed, the userdemands can be quickly responded to, and the user experience isimproved.

Embodiment 2

The present embodiment provides a multimedia picture generatingapparatus, used for a photographing device, comprising:

an original photo acquiring unit 1 for acquiring a picture of aphotographed subject of the photographing device;

a foreground image extracting unit 2 for extracting a figure image as aforeground image from the picture after receiving an instruction forremoving picture background;

a command recognition unit 3 for receiving a voice command inputted by auser and performing voice recognition;

a matching unit 4 for searching out multimedia content that matches auser command information recognized by voice recognition from amultimedia database as background content for the picture; and

a generating unit 5 for generating a multimedia picture that containsthe foreground image and the background content.

By means of the multimedia picture generating apparatus in thisembodiment, when a user wants to replace the picture background, afigure image can be automatically extracted from the picture as aforeground image, and the original background with poor effect can beremoved, so that the user is not required to optimize the picturebackground by using a specialized picture editing software and insteadis only required to send an instruction for removing picture backgroundin order to replace the background with a satisfactory image, whichsimplifies the optimum processing and improves the user experience. Auser command information can be acquired by performing voice recognitionon user voice, so as to automatically search out multimedia content thatmatches the user command information from a multimedia database, andthus the user is not required to manually pick out multimedia contentthat matches his/her own demands from a vast amount of images and/ormusic, which increases the search efficiency. After that, a multimediapicture that contains the foreground image and the background content isgenerated, so that a perfect combination of visual and audial effectscan be attained.

Optionally, the matching unit 4 is for searching out an image and/ormusic corresponding to a keyword index identical or similar to a keywordfrom the multimedia database having keyword indexes corresponding toimages and music stored therein by using the user command information asthe keyword; and if no instruction for re-searching images and/or musicis received, setting the image and/or music corresponding to the keywordindex identical or similar to the keyword as the multimedia content thatmatches the user command information.

Optionally, the matching unit 4 is further for searching out an imageand/or music other than the images and/or music already searched out andcorresponding to the keyword index identical or similar to the keywordfrom the multimedia database by using the user command information asthe keyword if an instruction for re-searching images and/or music isreceived, until no further instruction for re-searching images and/ormusic is received; and setting the image and/or music finally searchedout and corresponding to the keyword index identical or similar to thekeyword as the multimedia content that matches the user commandinformation.

The multimedia picture generating apparatus in this embodiment ensuresthat the image and music finally selected as the background contentalways satisfies the user, which lays a foundation for generating amultimedia picture containing the foreground image and the backgroundcontent that has a high user satisfaction, thereby improving the userexperience.

Optionally, the command recognition unit 3 is for determining if aninstruction for removing picture background is received and determiningif an instruction for re-searching images and/or music is received byrecognizing user voice, so that the user is not required to manuallyinput an instruction, and by recognizing a voice made by the user, acorresponding instruction can be acquired and a corresponding operationcan be carried out according to the instruction, so that both hands ofthe user are freed, the user demand can be quickly responded to, and theuser experience is improved.

Embodiment 3

The present embodiment provides a multimedia picture generating devicecomprising: a photographing apparatus 11, a multimedia picturegenerating apparatus 12, a display apparatus 13, and a player apparatus14.

The photographing apparatus 11 is for taking a picture of a photographedsubject of a photographing device. Specifically, the photographingapparatus 11 may comprise components such as a camera head, aflashlight.

The multimedia picture generating apparatus 12 is connected with thephotographing apparatus 11 and is for acquiring a picture sent from thephotographing apparatus 11, extracting a figure image as a foregroundimage from the picture after receiving an instruction for removingpicture background, performing voice recognition after receiving a voicecommand inputted by a user, searching out multimedia content thatmatches a user command information recognized by voice recognition froma multimedia database as background content for the picture, andgenerating a multimedia picture that contains the foreground image andthe background content. Specifically, the multimedia picture generatingapparatus 12 may be a controller with built-in programs that can performthe steps in Embodiment 1, so as to achieve automatic optimum processingof a picture. The multimedia picture generating apparatus 12 has accessto a multimedia database in the memory of the photographing device, andalso has access to a multimedia database stored in devices other thanthe photographing device, which provides technical support for searchingout an image and/or music that satisfies the user demand.

The display apparatus 13 is connected with the photographing apparatus11 and the multimedia picture generating apparatus 12, and is fordisplaying the picture and the multimedia picture. Specifically, thedisplay apparatus 13 may he a display screen which presents the picturethat has been photographed to the user in order for the user to timelydetermine whether the background thereof needs to be removed, presentsthe image that has been searched out to the user in order for the userto timely determine if the image needs to be re-matched, and also allowsthe user to instantaneously enjoy the multimedia picture that has beenoptimized and generated.

The player apparatus 14 is connected with the multimedia picturegenerating apparatus 12 and is for playing the background content.Specifically, the player apparatus 14 may be a micro-electronicloudspeaker which plays the music that has been searched out to the userin order for the user to timely determine whether the music needs to bere-matched, and also allows the user to instantaneously enjoy the musicin the background content of the multimedia picture.

By means of the multimedia picture generating device in this embodiment,when a user wants to replace the picture background, a figure image canbe automatically extracted from the picture as a foreground image, andthe original background with poor effect can be removed, so that theuser is not required to optimize the picture background by using aspecialized picture editing software and instead is only required tosend an instruction for removing picture background in order to replacethe background with a satisfactory image, which simplifies the optimumprocessing and improves the user experience. A user command informationcan be acquired by performing voice recognition on user voice, so as toautomatically search out an image and/or music that matches the usercommand information from a multimedia database, and thus the user is notrequired to manually pick out an image and music that matches his/herown demands from a vast amount of images and music, which increases thesearch efficiency. After that, a multimedia picture that contains theforeground image and the background content is generated, so that aperfect combination of visual and audial effects can be attained. Theuser is also not required to manually input an instruction, and byrecognizing a voice made by the user, a corresponding instruction can beacquired and a corresponding operation can be carried out according tothe instruction, so that both hands of the user are freed, the userdemands can be quickly responded to, and the user experience isimproved.

Embodiment 4

The present embodiment provides a cell phone comprising the multimediapicture generating apparatus in Embodiment 2 or comprising themultimedia picture generating device in Embodiment 3.

By means of the cell phone in this embodiment, when a user wants toreplace the picture background, a figure image can be automaticallyextracted from the picture as a foreground image, and the originalbackground with poor effect can be removed, so that the user is notrequired to optimize the picture background by using a specializedpicture editing software and instead is only required to send aninstruction for removing picture background in order to replace thebackground with a satisfactory image, which simplifies the optimumprocessing and improves the user experience. A user command informationcan be acquired by performing voice recognition on user voice, so as toautomatically search out multimedia content that matches the usercommand information from a multimedia database, and thus the user is notrequired to manually pick out an image and/or music that matches his/herown demand from a vast amount of images and/or music, which increasesthe search efficiency. After that, a multimedia picture that containsthe foreground image and the background content is generated, so that aperfect combination of visual and audial effects can be attained. Theuser is also not required to manually input an instruction, and byrecognizing a voice made by the user, a corresponding instruction can beacquired and a corresponding operation can be carried out according tothe instruction, so that both hands of the user are freed, the userdemand can be quickly responded to, and the user experience is improved.

Embodiment 5

FIG. 5 is a schematic diagram of the hardware configuration of theelectronic device provided by the present embodiment, which performs themultimedia picture generating method. As shown in FIG. 5, the electronicdevice includes: one or more processors 510 and a memory 520, whereinone processor 510 is shown in FIG. 5 as an example. The electronicdevice that performs the multimedia picture generating method furthercomprises: an input apparatus 530 and an output apparatus 540.

The processor 510, the memory 520, the input apparatus 530 and theoutput apparatus 540 may be connected via a bus line or other means,wherein connection via a bus line is shown in FIG. 5 as an example.

The memory 520 is a non-transitory computer-readable storage medium thatcan be used to store non-transitory software programs, non-transitorycomputer-executable programs and modules, such as the programinstructions/modules corresponding to the multimedia picture generatingmethod of the embodiments of the present disclosure (e.g. the originalphoto acquiring unit 1, the foreground image extracting unit 2, thecommand recognition unit 3, the matching unit 4 and the generating unit5 shown in FIG. 3). The processor 510 executes the non-transitorysoftware programs, instructions and modules stored in the memory 520 soas to perform various function application and data processing of theserver, thereby implementing the multimedia picture generating method ofthe above-mentioned method embodiments

The memory 520 includes a program storage area and a data storage area,wherein, the program storage area can store an operation system andapplication programs required for at least one function; the datastorage area can store data generated by use of the multimedia picturegenerating device. Furthermore, the memory 520 may include a high-speedrandom access memory, and may also include a non-volatile memory, e.g.at least one magnetic disk memory unit, flash memory unit, or othernon-volatile solid-state memory unit. In some embodiments, optionally,the memory 520 includes a remote memory accessed by the processor 510,and the remote memory is connected to the multimedia picture generatingdevice via network connection. Examples of the aforementioned networkinclude but not limited to internet, intranet, LAN, GSM, and theircombinations.

The input apparatus 530 receives digit or character information, so asto generate signal input related to the user configuration and functioncontrol of the multimedia picture generating device. The outputapparatus 540 includes display devices such as a display screen.

The one or more modules are stored in the memory 520 and, when executedby the one or more processors 510, perform the multimedia picturegenerating method of any one of the above-mentioned method embodiments.

The above-mentioned product can perform the method provided by theembodiments of the present disclosure and have function modules as wellas beneficial effects corresponding to the method. Those technicaldetails not described in this embodiment can be known by referring tothe method provided by the embodiments of the present disclosure.

The electronic device of the embodiments of the present disclosure canexist in many forms, including but not limited to:

(1) Mobile communication devices: The characteristic of this type ofdevice is having a mobile communication function with a main goal ofenabling voice and data communication. This type of terminal deviceincludes: smartphones (such as iPhone), multimedia phones, featurephones, and low-end phones.

(2) Ultra-mobile personal computer devices: This type of device belongsto the category of personal computers that have computing and processingfunctions and usually also have mobile internet access features. Thistype of terminal device includes: PDA, MID, UMPC devices, such as iPad.

(3) Portable entertainment devices: This type of device is able todisplay and play multimedia contents. This type of terminal deviceincludes: audio and video players (such as iPod), handheld game players,electronic books, intelligent toys, and portable GPS devices.

(4) Servers: devices providing computing service. The structure of aserver includes a processor, a hard disk, an internal memory, a systembus, etc. A server has an architecture similar to that of a generalpurpose computer, but in order to provide highly reliable service, aserver has higher requirements in aspects of processing capability,stability, reliability, security, expandability, manageability.

(5) Other electronic devices having data interaction function.

Embodiment 6

The present embodiment provides a non-transitory computer-readablestorage medium storing executable instructions that, when executed by anelectronic device, cause the electronic device to perform the multimediapicture generating method of any one of the above-mentioned methodembodiments.

The above-mentioned device embodiments are only illustrative, whereinthe units described as separate parts may be or may not be physicallyseparated, the component shown as a unit may be or may not be a physicalunit, i.e. may be located in one place, or may be distributed atmultiple network units. According to actual requirements, part of or allof the modules may be selected to attain the purpose of the technicalscheme of the embodiments.

By reading the above-mentioned description of embodiments, those skilledin the art can clearly understand that the various embodiments may beimplemented by means of software plus a general hardware platform, orjust by means of hardware. Based on such understanding, theabove-mentioned technical scheme in essence, or the part thereof thathas a contribution to related prior art, may he embodied in the form ofa software product, and such a software product may be stored in acomputer-readable storage medium such as ROM/RAM, magnetic disk oroptical disk, and may include a plurality of instructions to cause acomputer device (which may be a personal computer, a server, or anetwork device) to execute the methods described in the variousembodiments or in some parts thereof.

Finally, it should be noted that: The above-mentioned embodiments aremerely illustrated for describing the technical scheme of the presentdisclosure, without restricting the technical scheme of the presentdisclosure. Although detailed description of the present disclosure isgiven with reference to the above-mentioned embodiments, those skilledin the art should understand that they still can modify the technicalscheme recorded in the above-mentioned various embodiments, orsubstitute part of the technical features therein with equivalents.These modifications or substitutes would not cause the essence of thecorresponding technical scheme to deviate from the concept and scope ofthe technical scheme of the various embodiments of the presentdisclosure.

What is claimed is:
 1. A multimedia picture generating method,comprising: acquiring a picture of a photographed subject; extracting afigure image as a foreground image from the picture after receiving aninstruction for removing picture background; receiving a voice commandinputted by a user, and performing voice recognition; searching outmultimedia content that matches a user command information recognized byvoice recognition from a multimedia database as background content forthe picture; generating a multimedia picture that contains theforeground image and the background content.
 2. The multimedia picturegenerating method of claim 1, wherein, searching out multimedia contentthat matches the user command information recognized by voicerecognition from the multimedia database as background content for thepicture comprises: searching out an image and/or music corresponding toa keyword index identical or similar to a keyword from the multimediadatabase having keyword indexes corresponding to images and music storedtherein by using the user command information as the keyword; andsetting the image and/or music corresponding to the keyword indexidentical or similar to the keyword as the multimedia content thatmatches the user command information.
 3. The multimedia picturegenerating method of claim 2, wherein, after searching out an imageand/or music corresponding to the keyword index identical or similar tothe keyword from the multimedia database by using the user commandinformation as the keyword, the method further comprises: if aninstruction for re-searching images and/or music is received, searchingout an image and/or music other than the images and/or music alreadysearched out and corresponding to the keyword index identical or similarto the keyword from the multimedia database by using the user commandinformation as the keyword, until no further instruction forre-searching images and/or music is received; setting the image and/ormusic finally searched out and corresponding to the keyword indexidentical or similar to the keyword as the multimedia content thatmatches the user command information.
 4. The multimedia picturegenerating method of claim 1, wherein, judging whether an instructionfor removing picture background is received and judging whether aninstruction for re-searching images and/or music is received areperformed by recognizing user voice.
 5. A multimedia picture generatingdevice, comprising: a photographing apparatus (11) for taking a pictureof a photographed subject of a photographing device; a multimediapicture generating apparatus (12), connected with the photographingapparatus (11), for acquiring a picture sent from the photographingapparatus (11), extracting a figure image as a foreground image from thepicture after receiving an instruction for removing picture background,performing voice recognition after receiving a voice command inputted bya user, searching out multimedia content that matches a user commandinformation recognized by voice recognition from a multimedia databaseas background content for the picture, and generating a multimediapicture that contains the foreground image and the background content; adisplay apparatus (13), connected with the photographing apparatus (11)and the multimedia picture generating apparatus (12), for displaying thepicture and the multimedia picture; a player apparatus (14), connectedwith the multimedia picture generating apparatus (12), for playing thebackground content.
 6. An electronic device, comprising: at least oneprocessor; and a memory communicably connected with the at least oneprocessor for storing instructions executable by the at least oneprocessor, wherein execution of the instructions by the at least oneprocessor causes the at least one processor to: acquire a picture of aphotographed subject; extract a figure image as a foreground image fromthe picture after receiving an instruction for removing picturebackground; receive a voice command inputted by a user, and performingvoice recognition; search out multimedia content that matches a usercommand information recognized by voice recognition from a multimediadatabase as background content for the picture; generate a multimediapicture that contains the foreground image and the background content.7. The electronic device of claim 6, wherein, searching out multimediacontent that matches the user command information recognized by voicerecognition from the multimedia database as background content for thepicture comprises: searching out an image and/or music corresponding toa keyword index identical or similar to a keyword from the multimediadatabase having keyword indexes corresponding to images and music storedtherein by using the user command information as the keyword; andsetting the image and/or music corresponding to the keyword indexidentical or similar to the keyword as the multimedia content thatmatches the user command information.
 8. The electronic device of claim7, wherein, after searching out an image and/or music corresponding tothe keyword index identical or similar to the keyword from themultimedia database by using the user command information as thekeyword, the at least one processor is further caused to: if aninstruction for re-searching images and/or music is received, search outan image and/or music other than the images and/or music alreadysearched out and corresponding to the keyword index identical or similarto the keyword from the multimedia database by using the user commandinformation as the keyword, until no further instruction forre-searching images and/or music is received; set the image and/or musicfinally searched out and corresponding to the keyword index identical orsimilar to the keyword as the multimedia content that matches the usercommand information.
 9. The electronic device of claim 6, wherein,judging whether an instruction for removing picture background isreceived and judging whether an instruction for re-searching imagesand/or music is received are performed by recognizing user voice.
 10. Anon-transitory computer-readable storage medium storing executableinstructions that, when executed by an electronic device, cause theelectronic device to: acquire a picture of a photographed subject;extract a figure image as a foreground image from the picture afterreceiving an instruction for removing picture background; receive avoice command inputted by a user, and performing voice recognition;search out multimedia content that matches a user command informationrecognized by voice recognition from a multimedia database as backgroundcontent for the picture; generate a multimedia picture that contains theforeground image and the background content.
 11. The non-transitorycomputer-readable storage medium of claim 10, wherein, searching outmultimedia content that matches the user command information recognizedby voice recognition from the multimedia database as background contentfor the picture comprises: searching out an image and/or musiccorresponding to a keyword index identical or similar to a keyword fromthe multimedia database having keyword indexes corresponding to imagesand music stored therein by using the user command information as thekeyword; and setting the image and/or music corresponding to the keywordindex identical or similar to the keyword as the multimedia content thatmatches the user command information.
 12. The non-transitorycomputer-readable storage medium of claim 11, wherein, after searchingout an image and/or music corresponding to the keyword index identicalor similar to the keyword from the multimedia database by using the usercommand information as the keyword, the electronic device is furthercaused to: if an instruction for researching images and/or music isreceived, search out an image and/or music other than the images and/ormusic already searched out and corresponding to the keyword indexidentical or similar to the keyword from the multimedia database byusing the user command information as the keyword, until no furtherinstruction for re-searching images and/or music is received; set theimage and/or music finally searched out and corresponding to the keywordindex identical or similar to the keyword as the multimedia content thatmatches the user command information.
 13. The non-transitorycomputer-readable storage medium of claim 10, wherein, judging whetheran instruction for removing picture background is received and judgingwhether an instruction for re-searching images and/or music is receivedare performed by recognizing user voice.